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Abstract 

In this paper we consider a simple virus infection spread model on a 
finite population of n agents connected by some neighborhood structure. 
Given a graph G on n vertices, we begin with some fixed number of initial 
infected vertices. At each discrete time step, an infected vertex tries to 
infect its neighbors with probability /3 6 (0, 1) independently of others and 
then it dies out. The process continues till all infected vertices die out. 
We focus on obtaining proper lower bounds on the expected number of 
ever infected vertices. We obtain a simple lower bound, using breadth-first 
search algorithm and show that for a large class of graphs which can be 
classified as the ones which locally "look like" a tree in sense of the local 
weak convergence [I], this lower bound gives better approximation than 
some of the known approximations through matrix-method based upper 
bounds [3|- 
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1 Introduction 



1.1 Background and Motivation 

Often it is observed that the normal operation of a system which is organized in 
a network of individual machines or agents is threatened by the propagation of 
a harmful entity through the network. Such harmful entities are often termed as 
a viruses. For example the Internet, as a network is threatened by the computer 
viruses and worms which are self-replicating pieces of code, that propagate in 
a network of computers. These codes use a number of different methods to 
propagate, for example, an e-mail virus typically sends copies of itself to all 
addresses in the address book of the infected machine. Weaver et. al. [7] gives 
a good survey of different techniques of propagation for computer viruses. 

In this paper we use a simple susceptible infected removed ( SIR ) model which 
was studied by Draief, Ganesh and Massoulie in 2008 [3 . In this model, each 
susceptible agent, can be infected by its infected neighbors at a rate, propor- 
tional to their number and remains infected till it is removed after an unit time. 
While it is infected, it has the potential to infect its neighbors. In general, 
removal can correspond to a quarantining of a machine from the network or 
patching the machine. In this model, it is assumed that once a node is removed, 
it is "out of the network" . That is, it can no longer be susceptible or infected. 
Such a model is justified, provided the epidemic spread happens at a much faster 
rate than the rate of patching of the susceptible machines. 

The study of mathematical models for epidemic spread has a long history 
in biological epidemiology and in the study of computer viruses. One of the 
first work in this area was by Kermack and Mckendrick [5], where they estab- 
lished the first stochastic theory for epidemic spread. They also proved the 
existence of an epidemic threshold, which determines whether the epidemic will 
spread or die out. As mentioned in [3J, earlier work mainly focused on finding 
or approximating the law of large numbers limit where the stochastic behavior 
was approximated by its mean behavior and hence mainly studied determin- 
istic models. More recent works [2j [6], have focused on stochastic nature of 
the models and have tried to prove asymptotic distribution of the number of 
survivors, using a key concept called basic reproductive number Rq, which is de- 
fined as the expected number of secondary infective, caused by a single primary 
infective. This concept of basic reproductive number is well defined under the 
uniform mixing assumption, that is, when any infective can infect any suscep- 
tible equally likely, and hence the underlying network is given by a complete 
graph. For a general network, where basic reproductive number may become 
vertex dependent, it is not clear how to use this concept effectively. As in [3J, 
in this work we would like to study this model on a general network. 

1.2 Model 

We consider a closed population of n agents, connected by a network structure, 
given by an undirected graph G = (V, E) with vertex set V, containing all the 
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agents and edge set E. A vertex can be in either of the three states, namely, 
susceptible (S), infected (I) or removed (R). At the beginning, the initial set 
of infected is assumed to be non-empty and all others are susceptible. The 
evolution of the epidemic is described by the following discrete time model: 

• After an unit epoch of time, each infected vertex instantaneously tries to 
infect each susceptible neighbor with probability (3 e (0, 1) independent 
of all others. 

• Each infected vertex is removed from the network after an unit time. 

Mathematically, at an integer multiple of unit time, say t, if a susceptible vertex 
v has (t) neighbors who are infected, then the probability of v being infected 
instantaneously is 1 — (1 — j3) Iv ^ and each susceptible vertex gets infected inde- 
pendently. Also an infected vertex remains in the network only for an unit time, 
after that it tries to infect its susceptible neighbors and then it is immediately 
removed. 

As pointed out by [3J, this is a simple model, falling in the class of models 
known as Reed-Forest Models, where infection period is deterministic and is 
same for every vertex. It is worth noting that the evolution of the epidemic can 
be modeled as a Markov chain. 

It is interesting to note here that, the model is essentially same to the i.i.d. 
Bernoulli bond percolation model with parameter (3 [TU]. This is because the 
set of ever infected (or removed) vertices is same as the union of connected 
open components of i.i.d. bond percolation on G, containing all the initial 
infected vertices. Although for percolation, it is customary to work with an 
infinite graph G. If G is the complete graph K n , then this model is fairly well 
studied in literature which is known as the binomial random graph, also known 
as Erdos-Renyi random graph [H [IT] . 

Like in [3] , our goal is to study the total number of vertices that eventually 
become infected (and hence removed) without specifying the underlying net- 
work. In [3] , the authors derived explicit upper bounds to the expected number 
of vertices ever infected which depends on both the size of the network as well 
as the infection rate (3. These bounds also needed assumption of "small" value 
for (3. Unfortunately, the work [3] did not provide any indication of whether the 
derived upper bounds are good approximations to the exact quantity of interest. 
In this work we derive a simple lower bound to the expected number of vertices 
ever infected which works for every infection rate < f3 < 1. Our lower bound is 
based on the breadth-first search (BFS) algorithm and hence easily computable 
for any general finite network G. We also prove that, under certain assumptions 
on the qualitative behavior of the underlying graph, namely if G 11 locally looks 
like a tree" in the sense of Aldous and Steele [TJ local weak convergence, then 
our lower bound is asymptotically exact for "small" 13, thus provides a good ap- 
proximation when the network is "large" . As we will see later, for such graphs 
G, the range we cover for f3 always include the range in which the upper bounds 
from [3J works and in all these cases, the upper bound over estimates the exact 
quantity. 
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1.3 Outline 



In the following section, we state and prove our main results. Section [3] gives 
several examples where our lower bounds works and gives asymptotically correct 
answer. Finally in Section|3]we summarize the merits of work and indicate some 
of its limitations as well. 



2 Main Results and Proofs 

We will denote by Y G I , the total number of vertices ever infected when the 
epidemic runs on a network G and the infection starts at the vertices in / C V. 
Note that Y ' 1 implicitly depends on the size of the network. In Subsection 
12.11 we present the results, when the epidemic starts with only one infected 
vertex. We generalize these results for epidemic starting with more than one 
infection, which are presented in Subsection l2.2l In both cases, our results relay 
on a specific search algorithm, known as breadth-first search (BFS). We briefly 
describe the algorithm here. 

Step-0 Input graph G with a linear ordering of its vertices 

say V := {v , «i, «2, • • • ,v n -i) . Let T «- {v a } and N <- 
{vo}. 

Step-1 Write N = {vi ly Vi 2 , ■ ■ ■ ,Vi r } for some r > 1 such that i\ < 
%2 < ■ ■ ■ < i r ■ 

Step-2 For 1 = 1 to r find all neighbors u of which are not 
in T, put 



u ~ Vi, and u ^ T 



N' <- N' U | 
and update T as 

T <— T U |u u ~ Vi t and u $ T X . 
Step-3 Update N <- N' . 

Step-4 Go to Step-1 unless vertex set of T is same as that of 
V . 

Step-5 Stop with output T as the BFS spanning tree with root 
V Q . 

Note that the BFS spanning tree is not necessarily unique, it depends on the 
starting point vo which is typically called the root and also it depends on the 
ordering of the vertices in which the exploration of neighbors is done in Step-2. 
Also note that iff G is a tree to start with then, BFS spanning tree is just itself. 
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2.1 Starting with Only One Infected Vertex 



Our first result gives a lower bound to the expected total number of vertices 
ever infected starting with exactly one infected vertex. 

Theorem 1 Let G be an arbitrary finite graph and vq S V be a fixed vertex of it. 
Let T be a spanning tree of the connected component of G containing the vertex 
vq and rooted at vq. Let Y T '^ V °^ be the total number of vertices ever infected 
when the epidemic runs only on T and starting with exactly one infection at vq . 
Then 



E 



< E 



for all < (3 < 1 



(1) 



Moreover, if T is a BFS spanning tree of the connected component of vq rooted 
at vq , then 



E 



Y T '{ v o} 



< E 



Y r <i v o} 



< E 



yG,{n } 



for all < (3 <l. 



(2) 



Proof: First we observe that if H C G, is a spanning sub-graph of G and vo is 
a vertex in both H and G, then by definition y^dM < r G >»). This proves 

For the second part, we note that if T is a spanning tree of G with root Vo, 
then da (v, vq) < (v, vq) for all v where dc and dr are the graph distance 
functions on G and T respectively. Moreover, the BFS algorithm preserves the 
distances, so if 7" is a BFS spanning tree with root {vq} then we must have 
g?g (v, vq) = dj- {v, vq) for all v G V. Thus dj- {v, vq) < dr (w, «o) for all v G V. 
Now from the model description, it follows that for any spanning tree T with 
root vq we have 



E 



t(v,v ) 



So we conclude that 



E 



as < fi < 1. ■ 

Let LB G ' {,,o} := E [F r .{"o}] be the lower bound obtained through BFS 
algorithm for a BFS spanning tree T of G, rooted at vq. Then from the proof 
of Theorem [T] we get that 



LB 



G,{v } 



Hp 

vev 



d G (v,v ) 



(3) 



which is free of the choice of the BFS spanning tree. Later, we will see that, 
this helps us to generalize the lower bound for epidemic starting with more than 
one infected vertices at the beginning. We also note that LB G 'f" ^ can easily 
be computed using the breadth- first search algorithm described before. 
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Our next result shows that if we have a "large" finite graph G on n vertices 
and the epidemic starts with exactly one infected vertex vq, such that any 
cycle containing vo is "relatively large", that is of order O(logn), then the 
lower bound LB G '^ ^ given above, is asymptotically same as the exact quantity 

To state the result rigorously, we use the following graph theoretic notations. 
Given a graph G, a fixed vertex vq of G and d > 1 , let V d (G) be the set of vertices 
of G which are at a graph distance at most d from vq in G. Let Nd (G,vq) be 
the induced sub-graph of G on the vertices Vd (G) . 

Theorem 2 Let {(G„, ^ e a sequence of rooted graphs with roots {vq } n>1 

such that there exists a sequence a n — O (logn) with N an {G u ,Vq) is a tree for 
all n > 1. Then, there exists < fto < 1, such that for all < ft < fto 



E [yC, 



LB' 



G„,{v } 



1 as 7i — y oo . 



(4) 



Proof: Let T n be a BFS spanning tree rooted at Vq of the graph G n and as 
defined before let LB G "' { " o} = E [y^W] . Then 



LB G„,{M < E 

< E 

< LB 



Y G »,{<} 
yJV ct „(G„,<),{<} 



E 



yJV an (G n) <),{<} 



x x n 



G„,{do} 



+ LB G »'f" o} x /T» xn, 



(5) 



where the last inequality follows from the fact that N ari (G n ,Vo) is a tree and 
hence is a subtree of T n . This proves (jl]) since by assumption a n = O (logn). 



Although the assumption in the above theorem, may seem to be too restric- 
tive but it is satisfied by many examples including the n-cycle (see Subsection 
13. 2p . The method of the proof on the other hand, helps us to generalize the 
result for a large class of graphs including certain random graphs. 

Following Aldous and Steele [T], we say a sequence of rooted random or 
deterministic graphs {(G„, Vq )} n>1 with roots {vq} 71>1 converges to a random 
or deterministic graph (Goo,^) in the sense of local weak convergence (l.w.c) 

and write (G n , Vq) (Goo, if for any d > 1, 

P (Nd (G„, <) £ N d (Goo,v^)) — 4 1 as n ~> oo . (6) 

Note that for a deterministic sequence graphs, ([6]) really means that the event 
occurs for "large" ' enough n. 

Theorem 3 Let {(G„, ^o)} n >i ^ e a sequence of rooted deterministic or random 
graphs with deterministic or randomly chosen roots {^o}„>i- Suppose that for 
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each G n the maximum degrees of a vertex is bounded by A. Suppose there is a 
rooted deterministic or random tree T with root <h such that 



(G n , Vq) Lw - c -> (J ; 0) as n -» oo . 



(7) 



Let LB 



E 



of the graph G n . 
Then for j3 < 



E 



yT„,{<} 



where T n is a BFS spanning tree rooted at Vq 



as n — > oo . 



LB G n ,{v y 

Moreover for (3 < A we also get 

lim LB "'^ = lim E ryC„,{<}l = E \y 7 ^] 



(8) 



(9) 



Proof: Let Tn be a BFS spanning tree rooted at Vq of the graph G n and as earlier 
let LB G "' {l,o} = E [y r »Kl], Let £ n be the event [iV d (G„,^) AT d (T,0)]. 
Then from Theorem [1] 



LB 



G„,{«o} 



< E 



yG„,{<} 



E 



yG„,{<}-i 



E 



yG„,{<}-i 



(10) 



Now under our assumption, degree of any vertex of G n is bounded by A and 
f3 < so using Theorem 2.3 of [3] we get 



E 



yG^H"}! 



< 



1 

1-/3A 



P(^) • 



(11) 



Further note that Nd (G u ,Vq) is a tree rooted at Vq if £J n occurs and thus 
on E n , Nd (G n , Vq ) is a, sub-tree of T n - So 

Thus we get 

< E y JV <*( 7 «>'"o)>'K>'}i £ , + /3 rf E Y Gn ' d * Nd ( Gn ' v °* > 

< e y JV <*( 7 ~"' ,, ?)>W }\ E + (3 d ^ 



E 



yG„,{< 



< LB G "' {t, ° } +/3 fl 

< LB G "' {t, ° } 



1 



-E 



1-/3A 
1 

(l-/?Af 



1-/3A 

yGn.Kl 



(13) 



where d^Nd (G u ,Vq) denotes the infected vertices in G n after d units of time 
starting with one infection at vertex Vq. In the last two inequalities we use 
Theorem 2.3 of [3]. 
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So finally combining (|10p . (fTB")) and (fTTj) we get that for /3 < ^ an d for any 

d > 1 we have 



yG„,{<} _ LB G„,{«o}j < ^ 



1 



1 



d- f » r i-M PK) ' (14) 

Now under assumption ([7]), we have lim P (E%) = so we conclude that for 

n— >oo 

any d > 1 



limsup(E F G *'W> -LB G "' {t ' o} ) < /3 d 



(1-/3A)' 



(15) 



This proves (jSJ by taking d — ► oo as (i < 1. 

Now for proving ([§]) we first note that from definition 



E 



Y N d(G n ,v^),{v^} 1 



E„ 



E 



P (£„) ■ 



Thus arguing similar to the derivation of the equation (| 14[) we get 



E 



y-G n ,{t#}j _ E ryiV d (T,0),{0}p \ 



< F 



(1-/3A) 2 1-/3A 



(16) 



P(^n) • 



(17) 



Now from (JJJ we conclude that the degree of any vertex of T is also bounded 
by A. So using Theorem 2.3 of [3] we get that for f3 < 



E 



yiV d (T,^),{0} 



< 



1 



1-/3 A 



Moreover from definition, F^P^.W | y^,0 ^ d ^ So by MCT we get 



E 



Y 



< 



1 



1-/3A 



(18) 



Now under our assumption (J7]) we have P (E n ) — > 1. So from (|T7|) we conclude 
that 



lim E Y G "' {v o } 
n— >og 

Thus using ©, it follows that 

lim LB G "' {vo} = lim E 
This completes the proof. 



E 



Y 



To 



(19) 



: E [Y 7 '*] 



Following is immediate but interesting application of the above theorem 
which gives an explicit formula for the limit of epidemic spread on a randomly 
selected r-regular graph when infection starts from an randomly chosen vertex. 
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Theorem 4 Suppose G n is a graph selected uniformly at random from the set 
of all r -regular graphs on n vertices where we assume nr is an even number. 
Let Vq be an uniformly selected vertex of G n . Then for j3 < 



1 



lim E 



Y G "'{ v o} 



1 + /3 



1- (r-l)j8 ' 



(20) 



We note that in this case, the upper bound given in [3] is x \p when /3 < £ 
which is strictly bigger than the exact answer given in (|20p. 
Proof: It is known [5J [T] that if G n is a graph selected uniformly at random 
from the set of all r-regular graphs on n vertices, where nr is even and v$ be a 
randomly selected vertex of G n then 

(G„,<) l^(T r ,4>) , (21) 

where T r is the infinite r-regular tree with root say <f>. The result then follows 
from Theorem |3] and equation 



2.2 Starting with More than One Infected Vertices 

Now suppose instead of one infection, we start with k infected vertices given 
by / := {uo,i, fo,2) • " > ^o.fc}- Following theorem gives a lower bound similar to 
that of Theorem [T] 

Theorem 5 Let G be an arbitrary finite graph and I := {vo.j}j =1 be a fixed 
set of k vertices. Let T be a spanning forest of the connected components of G 
containing the vertices in I with exactly k trees which are rooted at the vertices 
in I . Then 

E [Y TJ ] < E [Y GJ ] for all < (3 < 1 . (22) 

Moreover, if T is a breath-first-search spanning forest of the connected compo- 
nents of G containing the vertices in I with exactly k trees which are rooted at 
the vertices in L then 

E [Y TJ ] < E [Y TJ ] < E [Y GJ ] for all < /3 < 1 . (23) 

Given a finite labeled graph G and a fixed set of vertices / = {vo,j}^ =1 of it, 
by a breath-first-search spanning forest of the connected components of G con- 
taining the vertices in / with exactly k trees which are rooted at the vertices 
in /, we mean a spanning forest of G with exactly k connected components 
which are rooted at the vertices {«o,i, vq.2, • • • , vo,k}, that are obtained through 
the breath-first-search algorithm, starting at some vertex v € / and assuming 
that all the vertices {i>o,i, Vo t 2, ■ • ■ , vo,k} are at the same level. Alternatively, we 
can consider a new graph G* which is same as G except it has one "artificial" 
vertex, say v* which is connected to the vertices Uo,i,«o,2; ■ • • 7^0, fc through k 
"artificial" edges and we perform the BFS algorithm on G* starting with the 
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vertex v* obtaining a BFS spanning tree, say T* of G* rooted at v*. Then a 
breath-first-search spanning forest of G with exactly fc trees which are rooted 
at the vertices {uo.ijWo^j ■ • ■ , «o,fc} is given by the forest T* \ {v*}. This al- 
ternative description is quite useful in practice. Note that if {Ti} 1<i<k be the 
fc connected components, rooted respectively at {t>o,i, i>o,2, ■ ■ ■ , vo,k} of T, then 
from definition, the following identity holds for every j3 G (0, 1) 

E [Y^] = £ E [Y^] = E[yT '^ 1] ' 1 . (24) 

i=i ' 

Using the above identity, we can now generalize all the results of the previous 
section for epidemic spread starting with more than one infected vertices. 

We write LB G/ for E f^ 7 "' 7 ] which is the lower bound for starting with k 
infected vertices given by /. Observe that from equation (|24| one can represent 

fc 

LB GJ = J2V[Y %J ] , (25) 
»=i 

fc 

where T = U T% is as above. It is worth nothing here that the lower bound 

i—l 

LB G/ does not depend on the choice of T but the representation given in 
equation (|23|) uses a specific choice of T ■ 

Theorem 6 Let {{G n , I n )} n>1 be a sequence of graphs where each G n has k- 
roots given by I n :— |wq i> v o 2) ' ' ' i^ofc} suc ^ that there exists a sequence a n = 

O(logn) with N an (G n ,I n ) := U N a (G n ,Vn ,,) is a forest with k components. 

. =1 ,jj 

Then there exists < /?o < 1, such that for all < j3 < /3o 

— 1 „ -. J — >■ 1 as n — t 00 . 26 

The proof of this result is exactly similar to that of Theorem [5] when one uses 
the identity (|24|) . The details are thus omitted. 

Our next result is parallel to the Theorem [3] which needs a generalization of 
the concept of local weak convergence which was introduced by Wastlund [8] . 

We will say a sequence of random or deterministic graphs {G n } n>1 with 

k roots given by /„ := |^o,i' u o,2j ' ' ' ' u ofc}' n — ^ converges to a random or 

deterministic graph G M with fc-roots say := |wo°i, wg° 2 , • • • , i>o° fc | in the 

sense of local weak convergence (l.w.c) and write (G n ,I n ) Lw ' c ") (Goo, loo) if for 
any d > 1 

P (TVd (G n ,v$ d ) = iVd (Goo, «g^) for all 1 < j < fc) — > 1 as n -> 00 . (27) 



Note that for a deterministic sequence graphs, (j27|) really means that the event 
occurs for "large"' enough n. 
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Theorem 7 Let (G n ) n>1 be a sequence of deterministic or random graphs. 
Suppose each G n has deterministic or randomly chosen k roots given by I n :— 
} and maximum degree of each G n is bounded by A. Suppose 

k 

j := U Tj is a forest with k rooted tress with roots 1^ := {(pi, q>2, • • • ><Pk}- We 
j'-i 

assume that 

{G n ,I n ) — ■ — ^ (T, loo ) as n y oo . (28) 

Then for (5 < i 

(E [F g - 7 »] - LB "' 7 ") — ► , (29) 

as n — > oo. Moreover 

k 

lim LB ' 1 ' 1 " — lim E [y G »' J «] = E [Y 7 ' 1 ™] = J^E [V^.W . (30) 

Proof: For each n > 1 as done above we define a new rooted graph G* with 
artificial vertex w* which is connected to the the fc-roots in I n of G n through k 
artificial edges. Also we consider T* defined similarly with an artificial root <jf 
connecting to {<fii, <f>%, ■ • ■ ,</>&}. Then our assumption of local weak convergence 
f2"8"]) is equivalent to 

(G* n ,v* n )^(T,P). (31) 
This together with the relation (f2"4l and Theorem [3] completes the proof. ■ 

It is worth noting that in case {Tj} 1<3<( . are i.i.d. (if they are random) or 
isomorphic (if they are constant) then equation (|30[) can be reformulated as 

lim LB G " J " = lim E [Y G " J "] = E [r TJ ~] = fcE [f^'M] . ( 32 ) 



n— >oo 



Like in the case of starting with one infection, following is an immediate 
application of the above results. 

Theorem 8 Suppose G n is a graph selected uniformly at random from the set 
of all r-regular graphs on n vertices where we assume nr is an even number. Let 
I n := |«o i, Vq 2 , ■ ■ • ,«o,fe} ^ e ^ uniformly and independently selected vertices of 
G n . Then for p < ± 

lim E \Y G ^] = k \±i - - . (33) 

n^oo 1 J 1 - (r - 1) j3 y ' 

Proof: Since the vertices in /„ are selected unformly at random so using [T] we 
get 

(G n ,I n ) ^> (T r> /oo) , (34) 

where 1^ :— {4>i,4>2, ■ ■• ,4>k\ and T r is a forest with k infinite r-regular tree 
with roots in 1^ . The result then follows from Theorems [7] and |4j ■ 
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Once again we note that in this case, the upper bound given in [3J 

for j3 < i, is strictly bigger than the exact answer given in (|33[) and the gap 
increases with fc, the initial number of infections. 



3 Examples 
3.1 Tree 

If G is a tree and the epidemic starts with only one infection at a vertex say 4> 
which we call the root, then from the construction of the lower bound it is clear 
that LB G '^ = E [y G 'W] . In certain cases one can find explicit formula for 
this quantity. Two such examples are discussed below. 



Regular Tree Consider a rooted r-array tree (r > 2), with height m, denote 
it by T (r, m). In T (r, m) every internal vertex except the root <fi has degree r. 
A vertex v is said to be an internal vertex if it has a neighbor which is not on 
the unique path from v to cj). We assume that the degree of the root is (r — 1). 
Let fi m := E[Y~ T ( r ' m )-W]. Note that the total number of vertices in T(r,m) is 
- — — j— -. Now, to calculate the exact value of /x m we note that 



fi m = 1 + (r - 1) /3/x„ 



which gives the formula 



f'n 



[(r-l)fl 



m+l 



- 1 



(35) 



(36) 



(r-l)p-l 

As T(r,m) is a tree so the lower bound is exact, that is, LB T< - r ' m - l: ^ = /i m . 
Now upper bound from [3] is yz^rg which only works for j3 < i. If j3 < ^ then 
by Theorem [3] we get 



E 



yT(r),{*} 



= lim n m = 



1 - (r- l)/3 ' 



(37) 



where T (r) is the rooted infinite r-regular tree, where each vertex except the 
root <fi has degree r and the root degree is (r — 1). 

We observe a gap between the lower bound (which in this case the exact 
answer) to that of the upper bound from [3] . 

Now let T r be the infinite r-regular tree where each vertex including the root 
has degree r. Such a tree can be viewed as disjoint union of r rooted infinite 
r-regular trees whose roots are joint to the root, say <fi of T r . Thus from 
we get that for /3 < - 



= 1 



1 + /3 



l-(r-l)/3 l-(r-l)0 



(38) 
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Galton- Watson Tree Consider a Galton- Watson branching process start- 
ing with one individual. Let the mean of the offspring distribution be c > 0. 
We denote the random tree generated by this process as GW (c) with root 
4>. Once again, as discussed above since GW(c) is a tree, so LB GW(c) < w = 
g |^yGW(c),{0}j _ n ow in this case, the epidemic process starting with only 
one infection at (j>, results to a Galton- Watson branching process starting with 
one individual as the root and with mean of the new progeny distribution as 

f3c. So in particular if /3 < - then from standard branching process theory 
E [yGW(c),{0}] < ^ and ig eq ^ al tQ _j_ ^ 

3.2 Cycle 

Cycle graph is a graph that consists of a single cycle. We denote the cycle with 
n vertices by C n . For simplicity we assume n is odd and then from the BFS 
algorithm, it is immediate that starting with one infected individual, say at Vq, 
we have 

LB Cn < {vo} = 1 + 2 (p + ^ + '-'+P^) (39) 



which converges to as n — > oo for any < < 1. Now it is clear from the 
definition that 

Thus using Theorem [3] we conclude that if /3 < \ then 



(C„,<)^(Z,0). (40) 



lim LB c "' {u ° } = lim E [y c ™.W} 



But, in fact this holds for any < f3 < 1. This because for cycle, the assumption 
in Theorem [5] holds for a n — n/3 (say). Thus using the proof of Theorem [5] we 
conclude that the equation (|4"l"j) holds for any < f3 < 1. 

Now if the epidemic starts with k initial infected vertices given by /„ :— 

1^0,1 > v Q,2i ' ' ' i v 0,k 

} which arc uniformly distributed, then it is easy to see that 

(C n ,J n ) (Z,-, ()),<. (42) 

where Zj is just a copy of Z. Then by Theorem[7]we conclude that for < (3 < |, 

lim LB c,l,/ " - lim E \Y C ^] = k . (43) 

But in fact, because of exactly similar reason given above, we can use Theorem 
©with a n = O (n) to conclude that (gSJ) holds for all all < < 1. 

3.3 Generalized Cycle 

Suppose in a cycle graph we choose randomly without replacement 2m vertices 
and connect these vertices by joining edges between them where to > 1 is 
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fixed. We call this graph a Generalized Cycle and denote it by GC (n, m). Now 
consider the epidemic model on this graph with one initial infected site Vq. For 
large enough n, the probability of having at least one of the m pairs inside a 
neighborhood of Vq with radius r is given by 



1 



1 - 



2r(2r + 1) 
n(n — 1) 



which tends to zero as n — > oo. Therefore, a fixed neighborhood of the root is a 
tree with high probability, in fact it is isomorphic to a neighborhood of integer 
line. Hence by Theorem [3] it follows that for j3 < h 



lim LB GC (™' m ^ ,, °} = lim E [y G0 (' l > m )>W > 



1 + /3 
1-/3 

i k 



(44) 



Similarly if we start with k initial infected sites, say /„ := {«q j} - —1 which are 
chosen uniformly at random, then it is easy to see that 



(GC(n,m) ! J„)^>(Z i) 0) 1 <.< fe , 
where Zj is just a copy of Z. Thus by Theorem [7] we get 



lim LB GC ' n,m ' ,/ " = lim E fy«3MA 



= fc 



1 + /3 



when /3 < g, because the maximum degree in GC (n, m) is 3. 



(45) 



(46) 



3.4 Cube graph 

Cube graph is the graph obtained from the vertices and edges of the 3-dimensional 
unit cube. We denote it by Q 3 . Suppose initially only the vertex (0,0,0) is in- 
fected. Consider a BFS spanning tree T of Q 3 rooted at (0,0,0). Since Q3 has 
only 8 vertices so F r ' {(0 '°' 0)} takes values {0, 1, 2, 3, 4, 5, 6, 7} and 



LB 



T,{(0,0,0)} _ 



E 



yT,{(0,0,0)} 



l + /3(l-/?) 2 [l + (l-/?) + (l-/3) 2 ] 

+4/3 2 [(l-/3) 2 + (l-/3) 3 + (l-/3) 4 ] 

+3/3 3 [(l -13) + (1 - /?) 2 + 6(1 - (3) 3 + (1 - (3) 4 

+4/3 4 [6(l-/3) 2 + 4(l-/3) 3 ] 

+5/3 5 [2(l-/3) + 6(l-/3) 2 ] 

+24/3 6 (l - /?) 

+7/3 7 

1 + 3/3 + 3^ 2 + /3 3 
(1 + /3) 3 • 
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In general, the ci-dimensional cube graph say Qd is a d-regular graph which 
has n = 2 d vertices. Following a similar calculation as done above, one can 
show that for an epidemic starting at one vertex, the lower bound obtained in 
Theorem [T] for the expected total number of vertices ever infected is given by 
(1 + /?)<*. 

In this example computation of the exact value of E l"yQ<i,{(o,o,o)}j j s difficult, 
but we note that there is a gap between the upper bound obtained in [3] , namely 
— L_ which is valid only when /3 < i and our lower bound. Although this is an 
example which does not fall under any of the theorem we discuss in this paper 
and hence we are not sure if the lower bound gives better approximation. 



4 Discussion 



The goal of this study has been to get a better idea of the expected total number 
of vertices ever infected with as little assumption as possible on the underlying 
graph G. Our approach has been to find an appropriate lower bound to this 
expectation. Although from practical point of view, approximation from above 
with an upper bound is a more conservative method. As shown in the examples 
given in Section [3l the only known upper bounds obtained in [3] often over 
estimate the exact quantity Moreover the upper bounds in [3] are defined only 
for "small" values of the parameter /3. For an arbitrary finite network, we have 
obtained a lower bound for the expectation of ever infected vertices for any 
value of the parameter /3 which is computable through the breadth-first search 
algorithm. Theorems O El El an d El show that this lower bound is asymptotically 
exact for a large class of graphs when f3 value is "small" , which always includes 
the region on which the upper bounds from [3] are defined. 

However, we would also like to mention here that even though the lower 
bound we present, works for any infection parameter < /3 < 1 but if the 
underlying graphs has many loops, such as the complete graph K n , then it does 
not necessarily give a good approximation. To see this, consider the complete 
graph K n and suppose that the epidemic starts at a fixed vertex vq. Then the 
lower bound \jQ Kn ^ v °^ = 1 + (n — 1) /3. Now, let X\ be the number of infected 
vertices at time t = 1. In this case it is easy to see that X\ ~ Binomial (n — 1, (3). 
Let u be one of n — 1 — X\ vertices which are not infected at time t = 1. Since 
K n is the complete graph, so the conditional probability of u becomes infected 
at time t = 2 given X% is 1 — (1 — 0) Xl . Hence 



E 



y#„,{vo} 



> 1 + (n - 1)13 + E Un - 1 - (l - (1 - 

= 1 + (n - 1) /3 + (n - 1) - (n - 1) (l - 

- (n - 1) + (n - 1) (3 (1 - (3) (l - /3 2 ) 



Therefore we get 



lim sup 



E [ytfn.^o}] _ LB 



LB 



K n ,{v Q } 



> 



(47) 
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where LB K "' {v ° } := E[y r »>W >]. 

Here, it is worth mentioning that for the complete graph if we start with one 
infected vertex, then as discussed in Section [T] the set of ever infected vertices 
is no other than an Erdos-Renyi random graph with parameter n and j3. Thus 
asymptotic behavior of E Pk- k "'»>{ ,, °}1 is well understood in the literature [41 [TTj. 
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